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Summary 

An Arabidopsis thaiiana cDNA bank has been consti- 
tuted in a Saccharomyces cerevisiae expression vector 
based on the phosphogly cerate kinase (PGK) promoter 
and terminator. This bank was used to complement 
eight S. cerevisiae auxotrophic markers. All of them 
were corrected. These results confirm the quality of 
the bank and the feasibility of cloning plant genes by 
yeast mutant complementation. The cDNA comple- 
menting the ural yeast mutant was sequenced, 
analysed and shown to encode a dihydroorotic (DHO) 
dehydrogenase sequence. 

Introduction 

Since the cloning of a human cDNA by direct complementa- 
tion of the fission yeast Sctiizosacctiaromyces pombe 
(Lee and Nurse, 1987). many other genes have been 
obtained by this approach (see Becker ef a/., 1991; Lew 
et a/.. 1991 and references therein; Minet and Lacroute. 
1990; Schild et a/., 1990). There are two main limitations 
to this technique. First, only the basic functions of a 
eukaryotic cell are expressed in yeast, which excludes 
cloning the genes involved in differentiation. Secondly, 
many proteins function in narrow interaction with neigh- 
bouring proteins or nucleic acids (RNA polymerase subunits. 
ribosomal proteins, proteins involved in RNA splicing, 
etc.). In this case, heterospecific complementation is 
unlikely to function effectively in organisms which have 
widely diverged in evolution. We have prepared an 
A. thaliana cDNA bank in a yeast expression vector to test 
if the yeast complementation cloning approach, which 
was successful with mammalian genes, would work equally 
well with plants. 

Results 

Construction of the yeast vector 

We previously constructed pFL60 (Minet and Lacroute, 
1990), a yeast cDNA cloning vector adapted to the direct 
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cloning of cDNAs on the oligo(dT) tail borne by the vector 
itself (see Okayama and Berg, 1 982). This vector contains 
the phosphoglycerate kinase (PGK) promoter separated 
from its terminator by a multisite polyiini<er, the yeast 
URA3 gene, a small part of the 2 \Lvr\ yeast plasmid bearing 
the replication origin and the £ coli vector pUC19. 
An oligo{dT) adaptor subsequently had to be introduced 
in a unidirectional manner. As the Okayama and Berg 
method gave us too few clones, we decided to adapt 
pFL60 for the use of the SsfXl double-site digestion 
technique which creates incompatible cohesive ends on 
the vector (Amffo and Seed. 1987). A Not\ linker was 
introduced in place of the aa\-Bgl\\ polylinker of pFL60 
and a new polylinker was subsequently introduced in the 
A/of 1 site with the sequence given in Figure 1 . This new 
vector was named pFL61 . 

Preparation of the cDNA bank 

Complete young A. thaliana seedlings (stage two leaves), 
including roots, were frozen in liquid nitrogen and ground 
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Figure 1. Restriction map of the pFL61 yeast expression vector and 
sequence of the polylinker introduced in the A/of I site. 
The restriction sites indicated are one cut sites; the C/al and BgtW sites 
were used for the construction of pFL61. BsfXI sites are the insertion sites 
for the cDNA poo!. 
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in a mortar with frozen TE buffer with 5% SDS. The 
resulting powder was melted, extracted twice with an 
equal volume of phenol-chloroform and twice again after 
addition of 1/10 volume of 4 M NaCL Nucleic acids were 
precipitated with two volumes of ethanol and immediately 
enriched in poly(A)^ RNAs by chromatography on an 
oligo(dT) column. After elution and ethanol precipitation, 
the poly(A)^ RNAs were resuspended in water, quantified 
by UV absorbance and immediately frozen at -80X until 
use. The cDNAs were prepared from 5 of poly (A) ^ 
RNAs with an Amersham 'cDNA plus' cloning kit, using 
AMV reverse transcriptase for the synthesis of the first 
strand and RNase H and the Klenow fragment of E coli 
DNA polymerase for the second strand. To quantify sub- 
sequent cDNA recovery, a small amount (1 jxCi) of radio- 
active dCTP was added during the synthesis of the second 
strand. After blunt-end conversion, the cDNAs were iigated 
to DNA adaptors with CACA cohesive ends compatible 
with the GTGT ends produced In the vector by BstXi 
digestion. The cDNAs were then separated by size on a 
low melting point agarose gel which also removed the 
excess DNA adaptors. Three size classes were recovered 
from the gel: one from 0,2 to 1 kb, one from 1 to 2.5 kb and 
the last one from 2.5 kb to the top of the gel. The calculated 
recovery (as a percentage of the total radioactivity) of the 
cDNAs was 33% in the short class. 22% in the medium 
class and 1 0% in the large class; a rough evaluation of the 
corresponding amounts of cDNAs gave, respectively, 
1 .5 ^Q, 1 p.g and 0.4 p.g. An approximately stoichiometric 
amount of SsfXI digested vector was added (with prior 
removal of the small internal BsfXI fragment) and each 
class was Iigated separately. After E coli transformation 
by electroporatlon (strain MR 32), 7 x 10^ clones were 
obtained from the short class, 1 0^ clones from the medium 
class and 5.6 x 10^ clones from the longer class. Eight 
clones, chosen randomly in each class size, were analysed 
and the length of their cDNA insert measured after digestion 
by the restriction enzyme A/of I. The average insert size is 
0.7 kb for the short class, 1 .6 kb for the medium class and 
2.4 kb for the long class. The clones were then pooled 
together, an aliquot was frozen at -80*'C for future amplifi- 
cation and the remainder was used to prepare plasmid 
DNA in a caesium chloride gradient. 

Complementation ofS, cerevisiae mutants 

To test the cDNA bank, different auxotrophic strains of S. 
cerevisiae were transformed with purified plasmid DNA 
and prototrophic clones were selected for growth on 
appropriate medium. The strains used, the transformation 
method and the selection details are given in Experimental 
procedures. Eight different genes were tested for 
complementation: URA1, URA2, URA4, URA5-URA10, 
ADE2, HIS3. LEU2 and TRP1 ; all of the eight yeast mutants 
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Figure 2. Electrophoresis after Not\ digestion of A thaliana cDNA inserts 
complementing: A,ura1: B. ura2: C, ura4\ D. ura5~ura10\ E. ade2\ F, his3; 
G, feu2: H. trpi S. cerevisiae mutants. \: phage X digested by H/ndlll as 
molecular weight standard, p; pFL6l digested by Notl. 

were complemented by the pool. It has been shown 
(Bertauche etal, unpublished results) that the ura5-ura10 
complementing cDNA was also able to complement a ura3 
mutant indicating that, as expected from enzymological 
results (Doremus, 1986; Walther et af., 1984), plants 
possess a bifunctional protein having both orotate phospho- 
ribosyl transferase and orotidine 5'-phosphate decarboxy- 
lase activities. 

In the case of ade2, his3. Ieu2 and trp 1 complementation, 
the yeast strain also harboured a uraS marker, which 
allowed us to measure the total number of transformants 
(complementation of uraS by the URAS gene of the pFL61 
plasmid) and thus estimate the frequency of complementing 
cDNA, which ranged between 5 x 10"^ and 5 x 10~^. 
A similar number of complementing clones was observed 
in the case of ura4 and of ura5-ura10 (where the total 
number of transformants could not be measured). However 
this number was one and two orders of magnitude lower 
for the ural and ura2 complementation, respectively. 
Moreover, the ural and ura2 complementing clones grew 
very poorly (doubling time of 5 h on minimal medium 
compared to 2 h for a prototrophic wild-type strain), indica- 
tion a poor complementation. 

In each case, the occurrence of true complementation 
was confirmed by a loss of prototrophic phenotype 
accompanying loss of the plasmid, and by a systematic 
recovery of the prototrophic phenotype after retransfor- 
matlon by the purified plasmid. 

Figure 2 presents the size of the different complementing 
inserts after A/of I digestion. In view of our previous interest 
in the pyrimidine pathway we have chosen to study the ural 
complementing insert in more detail. The corresponding 
plasmid was called pPyrDARA. 
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CTACAAGCTTA 

AAACCCTCTTCTTATAGATTTCAAAGCAACCATTTTCGAACTCTGTTTGATTCCACTGTCGATTTAATCATCTTACCTA 
1/1 

ATG GCC GGA AGG GCT GCG ACG TCG TCG GCG AAA TGG GCG AGA GAG TTT TTG TTC AGA AGG 
Met ala gly arg ala ala thr ser ser ala lys trp ala arg glu phe Icu phe arg arg 
61/21 9\/3\ 

GTT TCG TCT AAT CCT CTT GGA GCT ACC CGT AAC TGT TCT TCG GTT CCT GGA GCT TOT TCT 
val ser ser asn pro leu gly ala thr arg asn cys ser ser val pro gly ala ser ser 
121/41 151/51 

GCA CCG AAA GTC CCT CAT TTT TCC AAG AGA GGA AGG ATA TTG ACA GGA CCT ACC ATT GGT 
ala pro lys val pro his phe ser lys arg gly arg lie leu thr gly ala thr lie gly 
181/61 211/71 

CTC GCC ATA GCT GGA GGA GCT TAT GTT ACT ACT GCA GAT GAA GCA ACC TTC TGT GGG TGG 
leu ala lie ala gly gly ala tyr val ser thr ala asp glu ala thr phe cys gly trp 
241/81 271/91 

CTA TTC AAC GCA ACA AAG GTT GTG AAT CCT TTC TTT GCT CTT CTG GAT GCT GAG TTT GCG 
leu phe asn ala thr lys val val asn pro phe phe ala leu leu asp ala glu phe ala 
301/101 331/111 

CAT AAG CTG GCT GTT TCA GCT GCA GCT CGC GGG TGG GTA CCT 'AGA GAA AAG AGG CCT GAT 
his lys leu ala val ser ala ala ala arg gly trp val pro arg glu lys arg pro asp 

361/121 391/131 

CCA GC^ ATC TTG GGA CTT GAA GTT TGG GGA AGG AAA TTC TCA AAC CCA ATT GGG CTT GCT 
pro ala lie leu gly leu glu val trp gly arg lys phe ser asn pro lie gly leu ala 
421/141 451/151 

CCT GGA TTT GAC AAA AAC GCT GAG GCT ACT GAG GGA CTG CTA GGG ATG GGA. TTT GGC TTT 
ala gly phe asp lys asn ala glu ala thr glu gly leu leu gly met gly phe gly phe 
481/161 511/171 

GTT GAG CTA GGC TCT GTA ACT CCT GTT CCA CAA GAA GGC AAT CCG AAA CCA CGT ATC TTC 
val qlu val gly ser val thr pro val pro gin glu gly asn pro lys pro arg lie phe 
541/181 571/191 

AGA CTA AGC CAA GAA GGA GCT ATT ATC AAT AGG TGT GGA TTT AAT AGC GAA GGG ATT GTT 
arg leu ser gin glu gly ala lie lie asn arg cys gly phe asn ser glu gly He val 
601/201 631/211 

GTC CTT GCA AAG CGC TTG GGT GCT CAA CAT GGT AAA AGA ATG TTG GCT GAA ACA TCA GCT 
val val ala lys arg leu gly ala gin his gly lys arg met leu aia glu thr ser ala 
661/221 691/231 

ACT TCG TCA TCT CCA ACT GAT CAT GTA AAA CCA GGG GGA AAA TCT GGA CCT GGT ATT CTT 
thr ser ser ser pro ser asp asp val lys pro gly gly lys ser gly pro gly He leu 
721/241 751/251 

GGG GTC AAC CTT GGA AAG AAC AAG ACG AGT GAG GAT GCT GCT GCT GAC TAT GTC CAA GGA 
gly val asn leu gly lys asn lys thr ser glu asp ala ala ala asp tyr val gin gly 
781/261 811/271 

GTT CAT AAC TTA TCC CAG TAT GCT GAT TAC TTG GTG ATC AAT GTT TCA TCA CCC AAT ACT 
val his asn leu ser gin tyr ala asp tyr leu val He asn val ser ser pro asn thr 
841/281 871/291 

GCA GGA CTG CGC ATG CTT CAA GGA AGG AAA CAG TTG AAG GAC CTT GTA AAG AAG GTT CAA 
ala gly leu arg met leu gin gly arg lys gin leu lys asp leu val lys lys val gin 
901/301 931/311 

GCT GCT CGG GAT GAG ATG CAA TGG GGT GAT GAA GGT CCT CCT CCT CTT CTT GTG AAG ATT 
ala ala arg asp glu met gin trp gly asp glu gly pro pro pro leu leu val lys lie 
961/321 991/331 

GCT CCT GAT CTG TCC AGA GGA GAG CTT GAA GAT ATT GCA GCG GTG GCT CTT GCT CTG CAC 
ala pro asp leu ser arg gly glu leu glu asp He ala ala val ala leu ala leu his 
1021/341 1051/351 

TTA GAT GGG CTG ATC ATA TCA AAT ACA ACA GTC TCA AGG CCT GAT GCT GTA AGC AAC AAC 
leu asp gly leu He He ser asn thr thr val ser arg pro asp ala val ser asn asn 
1081/361 1111/371 

CCC GTC GCA ACA GAA ACA GGT GGT TTG AGC GGG AAA CCG CTC TTT GCT CTC TCC ACC AAC 
pro val ala thr glu thr gly gly leu ser gly lys pro leu phe ala leu ser thr asn 
1141/381 1171/391 

ATG TTG AGA GAT ATG TAC ACT TTG ACA CCA GGA AAG ATT CCA TTG ATA GCC TCT GGC GGG 
met leu arg asp met tyr thr leu thr arg gly lys He pro leu He gly cys gly gly 
1201/401 1231/411 

GTT AGT AGT GGT GAG GAT GCT TAC AAG AAA ATA AGA GCT GGA GCT ACT CTT GTT CAG CTG 
val ser ser gly glu asp ala tyr lys lys He arg ala gly ala thr leu val gin leu 
1261/421 1291/431 

TAC ACG GGA TTT GCC TAT GGA GGA CCT GCC CCA TCC CAC AAA TAA AGGAGGAACTCGTGAAATG 
tyr thr gly phe ala tyr gly gly pro ala pro ser his lys OCH 

CTTAGAAAGGGATCGCTTCAAGTCGATCCATGAAGCAATTGGTGCTGATCACAGATGATAAAACTCAAAGAGCAATACG 
CTGCGGAAAGATCCCTTTTCTTAAAACAACAAAACAAGTTGTATTTAAAGTTCATTGATTCAGTTGTCTCTTTCTACCC 

CCAAC AATA.AC ATCATCGTTTTTGCC 
1508 

Figure 3. Nucleotide and deduced amino acid sequences of the 1 .6 kb cDNA insert. The possible polyadenylation signal AATAAC is underlined. 
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Sequence analysis 

The 1598 bp sequence of the PyrD ARA insert is given in 
Figure 3 (Genbank/EMBL accession number X62909). 
There is a single open reading frame from base 91 to base 
1395, corresponding to a 435 amino acid coding sequence 
from the first ATG onward and followed by a 203 bp tail 
with a potential adenylation signal (AAUAAC) (Wickens 
and Stefenson, 1984) 16 bp before the 3' end. No poly(A) 
tail was found, which is probably due to a reduction in the 
length of the poly(dA/dT) tail during replications in E. cofi. 

Comparison with the NBRF protein databank reveals 
homology with the product of four genes known to encode 
DHO dehydrogenase (EC 1.3,3.1) in Salmonella 
typhimurium (Frick etaL, 1990), E. co// (Larsen and Jensen, 
1985). S. cerevisiaeand Dictyostelium discoideum (Jacquet 
et aL, 1985). These sequences were compared by dot 
matrix analysis (Figure 4) and by local alignment with the 
LFASTA program (data not shown). These comparisons 
revealed that the first 130 amino acids are specific to 
A thaliana and that a small central sequence between 210 
and 240 amino acids is present In A, thaliana and absent 
In the bacterial sequences. Statistical analysis of the simi- 
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Figure 4- Dot matrix comparisons of DHO dehydrogenase annino acid 
sequences. 

Matrix of Dayhoff, ktup = 1, 100 shuffled sequences. Sources: E. coli 
(Larsen and Jensen, 1986). Similar results were obtained with S. typhimurium 
(Frick etai.. 1990). D. discoideum (Jacquet etaL. 1985) and S. cerevisiae 
(EMBL accession number F32729). 



larities between pairs of sequences using a ktup value of 
1 and 100 shuffled sequences, gave significant Z values 
of 34.7, 21.4, 8.6 and 5.7, respectively (see Lipman 
and Pearson, 1985 and Pearson and Lipman, 1988 for 
discussion of the statistical meaning of the Z value). 

The 58 amino acid N-termina! sequence has features 
typical of mitochondrial targeting signals, i.e. it is rich in 
positively charged and hydroxylated residues and lacks 
acidic residues (Hartl etaL, 1 989). This is consistent with 
biochemical data (Doremus and Jagendorf, 1985) indicating 
that plant DHO dehydrogenase is a nnitochondrial enzyme. 

Northern analysis 

Poly(A'^) RNAs from A thaliana seedlings were analysed 
by Northern blotting, using the PyrD cDNA insert obtained 
after A/of I digestion as a DNA probe. Two A. thaliana 
mRNAs of about 1 .8 kb and 3,4 kb were detected. The 3.4 
kb mRNA was about 10 times less abundant than the 1.8 
kb species. No signal was obtained in the yeast RNA 
preparation (strain FL100) used as a control (Figure 5), 

Discussion 

We have constructed an A. thaliana cDNA bank in the 
yeast expression vector pFL61 . This bank is very efficient 
in cloning plant cDNAs by complementation of yeast 
mutants. All the eight yeast mutants tested could be 
complemented by this bank. This success may be due to 
the fact that we restricted our assays to enzymes involved 
in biosynthetic pathways. However. cDNAs encoding the 
p450 reductase proteins (Pompon, personal communi- 
cation) and the transport protein (Sentenac et a/., 1992) 
have also been obtained from this bank by yeast mutant 
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Figure 5. Northern analysis of poly(A)* RNAs. 
Lane A. A thaliana, Jane B, S. cerevisiae. 
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complementation. Furthermore, Alderson et aL (1991) 
have recently complemented an sfnl mutant by a rye 
cDNA coding for a protein kinase. 

The poor complementation of the ura2 mutant lacking 
aspartate transcarbamylase may reflect a difference in the 
synthesis of carbamyl phosphate (CP) and Its regulation, 
in agreement with previous work indicating that in plants 
there is only one CP synthetase common to the arginine 
and pyrlmidine pathways (0*Neal and Naylor. 1976). The 
1 .6 kb cDNA insert (PyrD ARA) which complements the 
ura1 yeast mutant was analysed by Northern blot and 
sequencing. This cDNA hybridizes with two mRNAs 
of about 1.8 and 3.4 kb. The cDNA itself is only 1.6 kb 
long rather than 1,8 kb, which probably results from the 
shortening of the poIy(dA/dT) tail during replication in 
E. coll. The 3.4 kb mRNA has a low abundance. The 
simplest explanation is that both transcripts are synthesized 
from the same gene by alternative splicing. The existence 
of an homologous gene appears unlikely since only one 
Hf'ndWi DNA fragment (2.4 kb) is detected at high stringency, 
in a Southern blot experiment (data not shown). 

Amino acid sequence comparisons have shown that the 
plant DHO dehydrogenase displays much nnore sequence 
similarity to the bacterial enzyme than to the DHO dehydro- 
genase of eukaryotes such as S. cerevisiae and 
D. d/scoideum (Jacquet et a/., 1985). One explanation 
would be the occurrence of intergenic gene transfer 
between plants and bacteria. However, a more realistic 
hypothesis is convergent evolution where the plant and 
bacterial enzyme may have evolved in response to a 
common selective pressure. This would be consistent with 
the membrane-bound nature of the plant (Doremus and 
Jagendorf. 1985) and bacterial enzyme (Karibian, 1978), 
whereas DHO dehydrogenase appears to be soluble, at 
least in D. discoideum (Jacquet et a/.. 1985). 



Experimental procedures 

Strains, media and genetical techniques 

Seeds of Arabidopsis thaliana (L). Heynh (Landsberg erecta 
ecotype) originating from J. Giraudat (CNRS, Gif-sur-Yvette) were 
used. Surface-sterilized seeds were germinated on germination 
medium containing Murashige and Skoog (1 962) salt mixture. 1 % 
sucrose. 100 mg 1"^ inositol. 1.0 mg I"' thiamine, 0.5 mg 1"' 
pyridoxine. 0.5 mg nicotinic acid and 0.8% agar and were 
submitted to a 16 h light photoperiod. Seedlings were harvested 
at the two-leaf stage. 

The B. coii strain MR32 is a RecAl derivative of MCI 061 
(Casadaban and Cohen, 1980) constructed by E. Petrochilo 
(personal communication). Five S. cerevisiae strains were tested 
for complementation by the A thaliana pool. W303-1B {mat a, 
ura3-1, Ieu2'3,1i2. ttis 3-11,15. ade2'101, trypl-l, can") was 
obtained from R. Rothstein, Aura2 is a complete deletion of the 
URA2 gene (Bach, personal communication). The three other 
strains are Isogenic to the wild-type strain FL100 (ATCC.28583). 
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:^ura1 was constructed by deletion of the H/ndlll-BsfEII fragment 
in the URA1 gene (Roy. EMBL accession number F32729). ura4 
is a point mutation obtained by UV mutagenesis and uraS-uraW 
is as described in de Montigny et at. (1990). 

After spheroplast transformation (Hinnen et al., 1978). trans- 
formants prototrophic for leucine, histidlne. adenine, tryptophan 
or uracil were selected by plating on omission medium; for 
example. H/S* recombinants were selected on standard minimal 
medium (Sherman etal., 1986) supplemented with leucine, adenine 
and tryptophan. The incubation time at 30X varied from 3 to 10 
days. The plasmlds were recovered and amplified from individual 
transformants by subcloning and transformation In the E. coli 
strain MR32 using electroporation. In each selection a control 
plate without plasmid DNA showed no reversion of the recipient 
strain, which is in good agreement with the fact that we always 
found a plasmid in the colonies obtained with transforming DNA. 

Sequence analysis 

DNA sequences on both strands were determined by the Sanger 
dideoxynucleotide method using a Sequenase Sequencing kit 
(USB). We performed either single-strand sequencing from two 
subclones in M13 or double-strand sequencing on the original 
pPyrD ARA plasmid. 

Northern hybridization 

Northern blots of poly(A)' RNAs <4 jjtg) from A. thaliana and yeast 
cells were as described in Sambrook et at. (1989) with minor 
modifications. After denaturation with formaldehyde and electro- 
phoresis in a 1.2% agarose gel. the RNAs were transferred to 
nitrocellulose and hybridized to ^^P-labelled probes prepared by 
multiprime labelling. Nonspecific hybridization was washed away 
by a 30 min treatment at 60*0 in 0.1 % SOS and 0.2 x SSC. 
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